An Optimal Processor Replacement Scheme for Efficient Communication of Runtime Data Redistribution

نویسندگان

  • Ching-Hsien Hsu
  • Kun-Ming Yu
  • Chi-Hsiu Chen
  • Chang Wu Yu
چکیده

AbstractDynamic data distribution is used to enhance data locality and algorithm performance with reducing inter-processor communication in data parallel programs on distributed memory multi-computers. Since the exchange of data is performed at run-time, there is a performance tradeoff between the efficiency of the new data decomposition for a subsequent phase of an algorithm and the cost of exchanging data among processors. In this paper, we present an Optimal Processor Replacement (OPR) scheme to minimize data transmission cost for general BLOCK-CYCLIC data redistribution. The main idea of the proposed techniques is to employ a size oriented greedy matching method or a maximum bipartite matching theory for exploring an ideal one-to-one mapping between logical processors. Based on the matching policy, a realigned sequence of destination processors can be derived and is then used to perform data redistribution in the destination phase. A significant improvement of this approach is that OPR achieves the highest rate of data remain in local space and leading minimum inter-processor communication. The optimal processor replacement scheme can handle array redistribution with arbitrary source and destination distribution in BLOCK-CYCLIC type and can be applied to multidimensional arrays. To evaluate the performance of the proposed technique, we have implemented the OPR method on an SMP Cluster with 24 nodes. The theoretical analysis and experimental results show that our technique provides considerable improvement for runtime data redistribution.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An efficient certificateless signcryption scheme in the standard model

Certificateless public key cryptography (CL-PKC) is a useful method in order to solve the problems of traditional public key infrastructure (i.e., large amount of computation, storage and communication costs for managing certificates) and ID-based public key cryptography (i.e., key escrow problem), simultaneously. A signcryption scheme is an important primitive in cryptographic protocols which ...

متن کامل

Improving communication scheduling for array redistribution

Many scientific applications require array redistribution when the programs run on distributed memory parallel computers. It is essential to use efficient algorithms for redistribution, otherwise the performance of the programs will degrade considerably. The redistribution overheads consist of two parts: index computation and inter-processor communication. If there is no communication schedulin...

متن کامل

An Incentive-Aware Lightweight Secure Data Sharing Scheme for D2D Communication in 5G Cellular Networks

Due to the explosion of smart devices, data traffic over cellular networks has seen an exponential rise in recent years. This increase in mobile data traffic has caused an immediate need for offloading traffic from operators. Device-to-Device(D2D) communication is a promising solution to boost the capacity of cellular networks and alleviate the heavy burden on backhaul links. However, dir...

متن کامل

Assessing the cost of redistribution followed by a computational kernel: Complexity and performance results

The classical redistribution problem aims at optimally scheduling communications when reshuffling from an initial data distribution to a target data distribution. This target data distribution is usually chosen to optimize some objective for the algorithmic kernel under study (good computational balance or low communication volume or cost), and therefore to provide high efficiency for that kern...

متن کامل

Contention-Free Communication Scheduling for Irregular Data Redistribution in Parallelizing Compilers

The data redistribution problems on multi-computers had been extensively studied. Irregular data redistribution has been paid attention recently since it can distribute different size of data segment of each processor to processors according to their own computation capability. High Performance Fortran Version 2 (HPF-2) provides GEN_BLOCK data distribution method for generating irregular data d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004